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Abstract 

In high-frequency financial data not only returns, but also waiting times between 
consecutive trades are random variables. Therefore, it is possible to apply continuous- 
time random walks (CTRWs) as phenomenological models of the high-frequency 
price dynamics. An empirical analysis performed on the 30 DJIA stocks shows that 
the waiting-time survival probability for high-frequency data is non-exponential. 
This fact sets limits for agent-based models of financial markets. 
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1 Introduction 



Recently, various authors have discussed the application of the continuous- 
time random walk (CTRW) [1] to finance (see Sec. 2 for details). The basic 
point is that, in financial markets, not only returns, but also waiting times 
between two consecutive trades can be considered as random variables. To 
our knowledge, the application of CTRW to economics dates back, at least, 
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to the 1980s. In 1984, Rudolf Hilfer published a book on the application of 
stochastic processes to operational planning, where CTRWs were used for sale 
forecasts [2]. The (revisited) CTRW formalism has been applied to the high- 
frequency price dynamics in financial markets by our research group since 
2000, in a series of three papers [3,4,5]. Other scholars have recently used this 
formalism [6,7,8]. However, CTRWs have a famous precursor. In 1903, the 
PhD thesis of Filip Lundberg presented a model for ruin theory of insurance 
companies, which was further developed by Cramer [9,10]. The underlying 
stochastic process of the Lundberg- Cramer model is a particular example of 
CTRW. 

Among other issues, we have studied the independence between log-returns 
and waiting times for the 30 Dow- Jones-Industrial- Average (DJIA) stocks 
traded at the New York Stock Exchange in October 1999. For instance, accord- 
ing to a contingency-table analysis performed on General Electric (GE) prices, 
the null hypothesis of independence can be rejected with a significance level 
of 1 % [11]. In this letter, however, the focus is on the empirical distribution 
of waiting times. 

^From our study, it turns out that the marginal density for waiting times 
is definitely not an exponential function. After the publication of our paper 
series, different waiting-time scales have been investigated in different mar- 
kets by various authors. All these empirical analyses corroborate the waiting- 
time anomalous behaviour. A study on the waiting times in a contemporary 
FOREX exchange and in the XlXth century Irish stock market was presented 
by Sabatelli et al. [12]. They were able to fit the Irish data by means of 
a Mittag-Leffler function as we did before in a paper on the waiting-time 
marginal distribution in the German-bund future market [4]. Kyungsik Kim 
and Seong-Min Yoon studied the tick dynamical behavior of the bond futures 
in Korean Futures Exchange(KOFEX) market and found that the survival 
probability displays a stretched-exponential form [13]. Moreover, just to stress 
the relevance of non-exponential waiting times, ! a power-law distribution has 
been recently detected by T. Kaizoji and M. Kaizoji in analyzing the calm 
time interval of price changes in the Japanese market [14]. 



2 Theory 



The importance of random walks in finance has been known since the seminal 
thesis of Bachelier [15] which was completed at the end of the XlXth century, 
more than a hundred years ago. The ideas of Bachelier were further carried 
out by many scholars [16,17]. 

The price dynamics in financial markets can be mapped onto a random walk 
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whose properties are studied in continuous, rather than discrete, time [17]. 
Here, we shall present this mapping, pioneered by Bachelier [15], in a rather 
general way. It is worth mentioning that this approach is related to those of 
Clark [19] and to the introductory notes in Parkinson's paper [20]. As a further 
comment, this is a purely phenomeno logical approach. No specific assumption 
on the rationality or the behaviour of market agents is taken or even necessary. 
In particular, it is not necessary to assume the validity of the efficient market 
hypothesis [21,22]. Nonetheless, as shown below, a phenomenological model 
can be useful in order to empirically corroborate or falsify the consequences 
of behavioural or other assumptions on markets. In its turn, the model itself 
can be corroborated or falsified by empirical data. 

As a matter of fact, there are various ways in which random walk can be 
embedded in continuous time. Here, we shall base our approach on the so- 
called continuous-time random walk in which time intervals between successive 
steps are random variables, as discussed by Montroll and Weiss [1]. 

Let S(t) denote the price of an asset or the value of an index at time t. In a 
real market, prices are fixed when demand and offer meet and a transaction 
occurs. In this case, we say that a trade takes place. Returns rather than prices 
are more convenient. For this reason, we shall take into account the variable 
x(t) = log S(t), that is the logarithm of the price. Indeed, for a small price 
variation AS = S(t i+ i) — S(ti), the return r = AS/S(U) and the logarithmic 
return ri og = log[S '(U+i) / S '{tj)\ virtually coincide. 

As we mentioned before, in financial markets, not only prices can be modelled 
as random variables, but also waiting times between two consecutive transac- 
tions vary in a stochastic fashion. Therefore, the time series {x(ti)} is charac- 
terised by </?(£, r), the joint probability density of log-returns & = x(t i+ i)—x(ti) 
and of waiting times r« = ti+i — ti. The joint density satisfies the normalization 
condition / / d£dr<p(£,T) = 1. 

Montroll and Weiss [1] have shown that the Fourier-Laplace transform of 
p(x,t), the probability density function, pdf, of finding the value x of the 
price logarithm (which is the diffusing quantity in our case) at time t, is given 
by: 

S 1 — (p[K, S) 

where 

P(k,s)= dt / dxe~ st + mx p(x,t) , (2) 
and ip{r) — J d£ </?(£, r) is the waiting time pdf. 

The space-time version of eq. (1) can be derived by probabilistic considerations 
[4]. The following integral equation gives the probability density, p(x,t), for 
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the walker being in position x at time t, conditioned by the fact that it was 
in position x = at time t — 0: 

ft f + OO 

p(x,t) = 6(x)V(t)+ / cp(x-x',t-t')p(x',t')dt'dx', (3) 

where \&(t) is the so-called survival function, \1/(t) is related to the marginal 
waiting-time probability density ipij). The survival function ^(r) is: 

/■T /'OO 

*(r) = i - y ^( r = y ^( r ') ( 4 ) 

The CTRW model can be useful in applications such as speculative option 
pricing by Monte Carlo simulations or portfolio selection. This will be the 
subject of a forthcoming paper. 



3 Empirical evidence 

In Fig. 1, the waiting-time complementary cumulative distribution function (or 
survival function) ^(r) = 1 — J r ip(t')dt' is plotted for three different periods 
of the day (morning 9:00 - 10:59; midday 11:00 - 13:59, afternoon 14:00 - 
17:00) and for the GE time series of October 1999. In the above formula, 
ipij) represents the marginal waiting-time probability density function, ^(r) 
gives the probability that the waiting time between two consecutive trades is 
greater than the given r. The lines are the corresponding standard exponential 
complementary cumulative distribution functions: \&(r) = exp(— r/r ), where 
t is the empirical average waiting time. An eye inspection already shows 
the deviation of the real distribution from the exponential distribution. This 
fact is corroborated by the Anderson-Darling test [24]. The Anderson-Darling 
(AD) A 2 values for the three daily periods are, respectively, 352, 285, and 446. 
Therefore, the null hypothesis of exponential distribution can be rejected at 
the 5 % significance level. 

In Tab. 1, the values of the AD A 2 statistics are given for the 30 DJIA stocks 
traded in October 1999. In all these cases the null hypothesis of exponentiality 
can be rejected at the 5 % significance level. 



4 Conclusions and discussion 

Why should we care about these empirical findings on the waiting-time dis- 
tribution? This has to do both with the market price formation mechanisms 
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and with the bid-ask process. A priori, one could argue that there is no strong 
reason for independent market investors to place buy and sell orders in a 
time-correlated way. This leads to a Poisson process with exponential survival 
function for the bid-ask process. If price formation were a simple thinning of 
the bid-ask process, then exponential waiting times should be expected be- 
tween consecutive trades as well [23]. Eventually, even if empirical analyses 
should show that time correlations are already present at the bid-ask level, it 
would be interesting to understand why they are there. In other words, the 
empirical results on the survival probability set limits on statistical market 
models for price formation. 
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Fig. 1. Waiting-time complementary cumulative distribution function *(r) for GE 
trades quoted at NYSE in October 1999. Open diamonds represent ^(t) for the 
morning hours (9:00 - 10:59). There were 16063 trades in this period in October 
1999. The solid line is the corresponding standard exponential complementary cu- 
mulative distribution function with To = 7.0 s. Open circles represent *(r) for the 
period around midday (11:00 - 13:59). There were 20214 trades in this period in 
October 1999. The dashed line is the corresponding standard exponential comple- 
mentary cumulative distribution function with to = 11.3 s. Open squares represent 
*(t) for the afternoon hours (14:00 - 17:00). There were 19372 trades in this period 
in October 1999. The dash-dotted line is the corresponding standard exponential 
complementary cumulative distribution function with To = 7.9 s. The day was di- 
vided into three periods to evide! nee seasonalities. 
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Table 1 

For each daily period, the table gives the values of the empirical average waiting 
time To and the AD statistics A 2 [24]. y 
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